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1. INTRODUCTION 

Cooperative control of multi-agent systems (MASs) has received much attention from the research 
community in recent years. One active area in MASs is achieving state or output consensus among agents 
[1]-[4], and others, wireless sensor networks [5]—[7] and others, and task scheduling [8], [9], and others. One 
extension of consensus is multi-agent formation. In the literature of multi-agent formation, the formation is 
produced by introducing a bias to the consensus control law of each agent, see for example [10]-[15]. The 
bias is introduced so that the states or outputs of agents differ by the bias amount. Another formation control 
problem is by considering obstacle or collision avoidance [16], [17] and multi-robot navigation [18]—[22]. 
However, in the approaches mentioned above, the formation size cannot be controlled or changed during the 
operation of MAS. The environment where MAS operates may change over time, for example, MAS may 
encounter a path narrower than its current formation size. In this situation, the ability to adjust the formation 
size is essential for MAS in continuing its operation. 

Past works on formation control with formation size adjustment (scalable formation problem) are 
sparse. The works Coogan and Arcak [23] and Coogan et al. [24] solve the scalable formation problem of 
double integrator agents by introducing an auxiliary state that acts as a multiplier to the bias in the standard 
formation control law. In one of the methods discussed by [24], the auxiliary state is updated by using a 
consensus algorithm. Whereas in [23], the auxiliary state is updated by estimating the desired formation 
scaling factor (known only by leader agent (s)) by monitoring the relative position of agents’ neighbor. In 
[23], both the single link and multi-link method in monitoring the neighbors’ position is discussed, while in 
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[24], only the single link method is discussed. When the auxiliary state reaches consensus, all agents have 
similar multiplier to their formation parameter. Thus, the whole group reach a formation where its size is a 
multiplication to the consensus value of the auxiliary state. In [24], the leader-follower network is used so 
that the consensus value of the auxiliary state is dictated by the initial value of the leader agent’s auxiliary 
state. The work [25] discusses scalable formation for single integrator and double integrator agents by using a 
self-loop communication weight. The formation parameter is embedded in the self-loop communication 
weight. However, the communication network is considered to be fixed. 

Scalable formation control considering identical linear system agents is discussed in [26]. In [26], a 
consensus-based scalable formation algorithm is used as a dynamic compensator. Similar to [24], the work 
[26] also uses auxiliary state as the multiplier to the formation parameter. In addition to the scalable 
formation, the work [26] allow for orientation adjustment. This additional adjustment is achieved by adding a 
variable multiplied to the desired position in the dynamic compensator. The works [23]-[26] consider fixed 
communication networks and identical agents in their formulation. Moreover, the network used to update the 
auxiliary state (formation scaling factor variable) is a leader-follower network. Scalable formation for 
heterogeneous agents under time-varying network can be found in [27]. The work considers continuous time 
agent model and leader-follower network. 

Unlike the works [23]-[26] that consider scalable formation for identical agents under fixed 
network, this work considers scalable formation for non-identical agents under a_ time-varying 
communication network. Moreover, unlike the work [27], this work considers leaderless undirected network 
in addition to leader-follower network in updating the formation scaling variable. Non-identical agents’ 
consideration is important in the formulation of MAS since not all agents are made similar; and also, 
non-identical agents formulation allows for different agents to cooperatively work under similar network 
(for example, drone and ground robot). Time-varying communication consideration is important since the 
communication network among agents may be changing during the operation due to weather, packet drop, 
and signal loss. The time-varying network formulation is important to guarantee the MAS algorithm to work 
under these situations (under some common network assumption, for example jointly connected network). 

Leaderless network formulation is needed on a situation where leader-follower network is not 
suitable for MAS operation. An example is when the formation leader (the agent that determine the formation 
size) must be changed during the operation. In scalable formation under leader-follower network, the leader 
agent is the formation leader, and the leader agent does not receive information from other agents. Suppose 
agent q is the leader agent in the network. When the formation leader is changed to another agent; since there 
is no information coming to agent q, then the whole agents cannot reach consensus (spanning tree condition 
is violated). Formation leader change during the operation is needed when the formation leader experiences 
faulty which makes it unable to continue its purpose. 

The control input in this work is based on the internal model principal approach described by [3]. 
The internal model principle approach is to devise a reference output which will be tracked by the output of 
each agent. A bias is then introduced in the reference output so that agents track different output and hence 
reach a formation. Moreover, a virtual state is multiplied to the reference output for the purpose of resizing 
the formation. Two network cases in updating the virtual state, leader-follower and leaderless undirected 
(both are time-varying networks), are discussed in this work. In achieving scalable formation for leaderless 
undirected network, the result from [28] is used to estimate the consensus time duration. This time duration is 
needed by the formation leader to know when it can change the formation size into the new one. It is shown 
that the consensus time duration is a function of the desired error bounds between the desired and actual 
formation scaling factor. The proposed work is expected to contribute in the application of search and rescue 
[29]-[32], area surveillance [33], [34], and others. 

Non-negative and positive integer sets are indicated by Z$, and Z* respectively. Let M,L E€ Z* with 
M > L. Then Z“ := {0,1,2,---,M} and Z“ := {L,L + 1,---,M}. Meanwhile, R, R”, R"*™ refer respectively 
to the sets of real numbers, n-dimensional real vectors and n by m real matrices. Ip is the n X n identity 
matrix with 1,, being the n-column vector of all ones (subscript omitted when the dimension is clear). Given 
a set C, |C] denotes its cardinality. The transpose of matrix M and vector v are indicated by M’ and v’, 
respectively. Additional notations are introduced when required in the text. 


2. RESEARCH METHOD 

In this section, the problem will be formally presented, and the proposed distributed algorithm will 
be discussed. The proposed distributed algorithm is based on the internal model principle which has the 
advantage to be used when the communication network is time-varying, and the agent’s model are non- 
identical. Since the proposed algorithm involves the use of graph, a brief discussion on the graph notation is 
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discussed in the beginning of this section. The problem definition and the proposed algorithm are presented 
subsequently. 


2.1. Graph notation 

The network of N nodes described by a time-varying graph is G(t) = (V,€(t)) with vertex set 
V = {1,2,---,N} and edge set E(t) © V x V. When the graph sequence G(t) is a directed graph, the pair 
Gj, i) € E(t) if node j points towards node i at time t. On the other hand, when the graph sequence G(t) is 
undirected, (j,i) € E(t) if node j is adjacent to node i at time t and vice versa. The set of neighbors of node i 
at time t is Nj(t):= {j E€ V: J, i) € E(t),i + j}. We define d;(t) = |M (t)| to be the number of neighbors 
of agent i at time t. 
Remark 1. The largest possible d;(t) for alli € ZY, t > 0, is N — 1. 
The union of the graph sequence G(t) at the time interval [t,, tp] is defined as Ge =(Y, UP E(t)). 

The adjacency matrix A(t) = [ajj(t)] of G(t) is the Nx N matrix whose (i,j) entry is 1 if 
(Gj, i) E E(t), and O otherwise. When the graph G(t) is undirected, then a;;(t) = ajı(t) for all i,j E€ V. The 
standard Perron matrix P(t) = [p;;(t)] associated with to graph G(C) is defined by 


_fl-y-4@ i=j, 
pi =| y-a) iF], 


where y = z., and this imply that the off-diagonal term of P(t) is either Z or 0. By the above construction, it 


is easy to see that P(t) is a nonnegative matrix and its row sum equals one, implying that P(t) is a stochastic 
matrix, and also its diagonal entries are positive. 


2.2. Problem formulation 
Consider N non-identical agents where each agent i is given by the discrete-time model (1) and (2): 


x(t + 1) = A;x;,(t) + Byu,(t), i € ZN (1) 
y(t) = C;x;(t), i € ZN (2) 


where x;(-) € R", y;(-) ERP and u;() E€ R™ are the state, output, and control signal of agent i. 
Meanwhile, A;,B;,C; are the state matrix, input matrix, and output matrix of agent i, respectively. The 
variables n; E€ Zt,p € Z*, m; € Z* are the vector’s dimension of the state x;, output y,, and control input u;. 
By non-identical agents, it means that A;, B;, C; are not necessarily similar to Aj Bj, Cj for i,j € Z^. Also note 
that the output dimension for all agents is the same and this is necessary for agents to reach output-formation. 
The assumptions on the systems' matrices are: 
Assumption (A1). The pair (Aj, B;) and (Aj, Ci) are stabilizable and detectable for all i € Z". 

This work aims to coordinate the N agents in a distributed manner to form a scalable formation via a 
scaling factor (scalable formation). Let 6; E IR? and 6; E R?” be the desired outputs of agents i and j defined 
on a local coordinate system, then the scalable formation objective is such that: 


lim(i(t) — y()) =al- 5), Vis EZ, 


where a € Rt is the desired scaling factor of the formation that is determined by a leader agent (formation 
leader). 

The communication network among agents is represented by a time-varying graph G(t) with two 
cases being considered: 
- Case leader-follower (LF). G(t) is a directed graph with x € Z™ as the leader node. This implies that 

a,j;(t) = 0 for allt > 0 andj € Z”. 

- Case leaderless undirected (LLU). G(t) is an undirected graph. 
Remark 2. Let e, E R” be a vector of zeros except its kt” entry being 1. The consequences of Case LF is 
that the kt” row of P(t) is e; for all t > 0. Thus, e, P(t) = e} holds for all t > 0. This implies that e% is the 
left eigenvector of P(t) corresponding to its eigenvalue of 1 for allt > 0. 
Remark 3. In case LLU, P(t) is a doubly stochastic matrix. Therefore, 1y is the left eigenvector of P(t) 
corresponding to its eigenvalue of 1 for all t > 0, i.e., 1yP(t) = 1y forall t > 0. 
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2.3. Proposed control input for case LF 
The proposed control input to the it? agent (1)-(2) is based on the internal model principal approach 
discussed in [3]: 


uj(t) = KiX;:(t) + La,Ai(t) + Luwi Ct), i € ZY (3) 


where K; € R*™i is a feedback matrix designed such that (A; + B;K;) is Schur for all i € Z”, while 
Ly, E R™ and Ly, E R™* ” are the feedback vector and feedback matrix from the internal model principle. 
The state 2; € IR” in (3) is the estimate of x; obtained from 


R(t + 1) = Afi (t) + But) + HO — yi), ie ZN (4) 
Jit) = CX, (t),i € Z" (5) 


where ĵ; is the estimated output, and H; E€ R”i* P is feedback matrix designed such that (A; + H;C;) is Schur 
for all i € Z”. The variables A; € R and w; € R? in (3) are the states of reference generator, with A; as the 
formation scaling factor variable of agent i and w; as the common trajectory among all agents. The dynamics 
of J; and w; are 


Alt+1) = Dapy OYE, iez (6) 
wi(t +1) = S X} py (t)wjCt), iez” (7) 


In (6)-(7), pij (t) is the entry of the Perron matrix P(t). In (7), S is the state matrix of the reference 

generator state w; with the following assumption made on S: 
Assumption (A2). Eigenvalues of S lie on the unit circle. 
Meanwhile, the assumption on graph G(t) for case LF is. 
Assumption (A3). The graph G(t) is uniformly connected. 
The graph sequence G(t) is said to be uniformly connected if there exist a finite time horizon T > 0 such that 
for all t, the graph G£+" contains a spanning tree, or that there exist a directed path from at least one node to 
every other node. (A3) is a standard assumption on consensus problem under directed time-varying graph 
(see for example [35] and [1]). 

To achieve scalable formation, the output of agent i will be made to track its reference-output defined 
in (8): 


Yiref t) = 5A; (t) + Qw: Ct) (8) 


where ô; E€ R? (6; # 6;; i,j € Z”) is the desired output of agent i defined on the local coordinate system and 
Q is the output matrix of w;. Refer to [35] and [1], under (A3), the N agents (6)-(7) reach consensus 
exponentially, or that 2;(t) > Aq and w;(t) > W(t) exponentially for all i € Z", where W(t) is a solution to 
Wo(t + 1) = Swo(t) for some wọ €R”. Since 4;(t) >A. and w;(t) > w(t)for all i € Z, then 
Yi,rer (t) > Sif + Qw(t) for all i € ZN. Thus, we can see that asymptotically Vy; re f(t) contains two 
components. The first component, iœ, is the formation component scalable by A% and the second 
component, Qw(t), is the common (consensus) trajectory component. 
If each y; tracks Yi ref asymptotically, which will be shown shortly, then asymptotically the outputs 

of agents i and j, i,j € Z” will differ by ,.(6; — 6;), i.e. lim Oi) — y;(t)) = (ô; — 6;) for all i,j € Z”. 
This implies that A. is the formation scaling factor. The following lemma shows that for Case LF, suppose 
node x is the leader node, then agent x is also the leader agent that can determine /,, (formation leader): 
Lemma 1. Let node k € Z™ be the leader node in the leader-follower graph G(t). Given system of N agents 
(6). Suppose (A3) holds. Then A;(t) > Ax(to) for all i € ZY, where Ax(to) is the initial condition of A, and 
to is a moving initial time (tọ is not necessarily equals zero). Lemma | is a special case of the discrete-time 
case in [2]. The proof of Lemma 1 can be shown by taking results from [2] and utilizing Remark 2. Thus, the 
proof is omitted. 

By noting that asymptotically Y; ref (t) = iño + QW(t), where Aœ is a constant and W(t) is a 
solution to the dynamics wo(t + 1) = Swo(t), to ensure each agent tracks their own reference-output, the 
feedback vector Ly, and feedback matrix Lọ, are designed via the well-known internal model principle [36]. 
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Lj, in (3) is feedback vector computed by Ly, = Ty, — Killa; where I, and II,, are the solutions to the 
following Francis equations: 


Aill; + Bil; = Mh, (9) 
CMa; = ôi (10) 


On the other hand, Ly, is feedback matrix computed by Ly, = Tw; — Killw,, where Ty, and Iwọ, are the 
solutions to the following Francis equations: 


Ailly, + BIw, = Tw,S (11) 
Cw, = Q (12) 


To guarantee solvability of (9)-(10) and (11)-(12) for any 6; and Q, the following assumption is needed [36]: 
Assumption (A4). The matrices: 


F = ln; a sad Ü = čln; “| 
C 0 CG 0 


are full-row rank for all i € Z” and for every eigenvalue # of S. 


2.4. Proposed control input for case LLU 

The case LLU is similar to Case LF. The controller, observer, dynamics of à_i and w_i are similarly 

given by (3), (4)-(5), (6)-(7). All assumptions are also needed except (A3). Instead of (A3), the assumption 
on the communication graph is 
Assumption (A5). The graph G(t) is uniformly strongly connected. 
The graph sequence G(t) is said to be uniformly strongly connected if there exist a finite time horizon T > 0 
such that for all t, the graph G{*" is strongly connected, or that there exist a directed path between any two 
nodes. Since an undirected graph is bidirectional, an undirected graph that contains a spanning tree has the 
properties that any two nodes in the graph is connected by a path, which means that the graph is strongly 
connected. Therefore, (A5) is used here instead of (A3). 

We will now discuss the method so that a designated leader agent is able to determine the formation 
scaling factor. Since case LLU is similar to case LF, except of the network topology, it can be shown that the 
consensus value of A is also the formation scaling factor. We first state the following result which is the 
undirected network version of Lemma 1: 

Lemma 2. Given system of N agents (6) where the communication graph G(t) is undirected. Suppose (A5) 


holds, then A;(t) > D Aj (to) for all i € Z™. The proof of Lemma 2 can be shown by taking results from 


[2] and noting Remark 3. Thus, the proof is omitted. 

By Lemma 2, for case LLU, the formation scaling factor is the average of the initial condition of A. To 
make the consensus value of A equal to the desired formation scaling factor, then the leader agent must know 
the initial states of A; for all i which is too demanding. Our approach is to have the leader agent to wait until 
A reach consensus with some reasonable tolerance, and then the leader agent uses its own / as the estimate of 
the A of other agents. In other words, the problem is to compute the waiting time, t, before the leader agent 
can change its A to a new value such that the error bound between the desired formation scaling factor, Ag, 
and the actual formation scaling factor, Aœ, is known. We shall assume here that although the leader agent 
updates its à using (6), the leader agent is authorized to reset its A to any real number at any time. 

Let agent k E Z be the designated leader agent. We start the analysis by first defining the choice of 
the reset value of Ax or Ax(tonew)» followed by the analysis of the error bound between Ag and Aa , and the 
computation of t. Suppose v > 0 is chosen where it is desired that |As — Aal < v. Suppose further at 
t*, |A;(t*) — A; @*)| <v for all i,j € ZN, where t* = t+ tosg» Where to,,, is the previous value of to. Let 
agent xk reset its A at t = t* to 


2% = Nå — =I (13) 


where /,,(t*) is computed from (6). The following lemma shows that |A% — Aal < v holds: 
Lemma 3. Consider system of N agents (6). Let agent k € Z™ reset its A at t = t* by (13) and suppose 
|a;(t*) — a,(t*)| < v holds for all i, j € Z”, then |A» — Aal < v holds. 
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Proof from Lemma 2, the steady state value of À is the average of its the initial states. Let the new to, to, = 
t*, we can write the following for 1;, i € ZN: 
N N 
. 1 1 * 
Lim a(t) =) lone) =| D, lonw + Ae) 
j=1 J=1,j#K 
by replacing A% using (13), we obtain 
. 1 * * 
Lim) =E (E aje YCE) + Nag — (N = 1)Ax(t*)) 
= (Nha + DMs jan (UCE) — Ae(t"))) 
1 
= Aa + einen Pj = Aw» (14) 


where pj = A,(t*) — A,,(t*). Since |A,,(t*) — a(t") 
be written as: 


< v for all j,j + k, then —v < p; < v. Thus, (14) can 


N-1 N-1 
Aa — v < ño S Aa + 
> Aa- Y S Ào S Aa ty 
> |A% — Aal < v. 


v 


Then, the next problem is to compute the value of t such that at t* = t + tosia’ EA CAD) — 4; (t*)| < v holds. 
There are two steps that will be involved to compute t. For the first step, we first write the solution of (6) for 
all agents as : 


A(t +1) = P(t) P(to)ACto) = P(t, to)A(to) 


Let T € Z be the bounded communication interval where the union of graph sequence G(t) is strongly 
connected at least every T time steps. The following lemma [28] shows that, under (A5), the difference 
between P(t, tp) and 1yq(to) decays geometrically where q(to) = [q1(to), =", Gy (to) ]' E€ R” is a stochastic 
vector. 

Lemma 4. [28] Consider the product of the stochastic matrices P(t, tg). Let (A5) be satisfied, then for each 
to = 0 there is a stochastic vector q(to) such that for all i,j E€ ZY and t > tọ. 


t-to 


1 1 
|[PCt, to)lij — qj(to)| <2 (1 -r 


The reader is referred to [28] for the proof of Lemma 4. In view of Lemma 2, we know that 
lim P(t, to) = ~1yl n - Applying this to Lemma 4 and using to iqa8 the initial time, under (A5), we get 


[PE toad, = | < u, with 
E 


u=2(0-)') (15) 


where t = t* — tosia: Hf u is known, then t can be computed using (15). The second step is to derive t as a 
function of v where EA CAD) = A(t") 

Let |[P(c*, tool “| < y, for all i,j € Z%then [PE tona)l,; —[P(t*, toa], < 2p for all 
ij, € EZ. Since each Ay(t*) = Xj-[P(t", toad], Atocia) and Alt) = DILaLP(C* tora )], Av Coora)> 


i, ? € Z”, we can write J; (t*) — Ag (t*) = fed (lP, tonal, = [P(t*, tos) |,,) Aj (tonia) thus 


T 
T 


< v for all i,j € Z", which will be given next. 
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N 
ACE") — ae s > (Pe tosia) lij = [P(t tora) Cosa) 
j 
|(IPE; tona)l;; = [PCE tona)l, | | 2; (tova)l 
2u/Aj(tora) | = 2H (A (Coal, =v 


for all i, £ € Z". Substituting (15) to the above result, we have 


n|; aloa) + 


a SE (16) 


= ((--)”) 


Remark 5. The value lA(to,, all, for toa = 9 can be estimated by defining the ball that contains the 
possible initial value of A; for all i. Whereas for the subsequent to 4 > 0, lA(to,, all, can be estimated by 


utilizing the previous value of v and A,. 


3. RESULTS AND DISCUSSION 
3.1. Main theorem and discussion for case LF 
By having all necessary results established in section 2.3, we can present the main result of case LF: 
Theorem 1: Given system of N agents (1)-(2) with control input u;(t) given by (3), observer dynamics given 
by (4)-(5), reference generator dynamics given by (6)-(7) where the communication graph G(t) is directed 
leader-follower with x € Z™ as the leader node, and reference-output Yi,ref (t) given by (8). Let Aq be the 
desired scaling factor of the formation and setting Ax(to) = Ag. Suppose (A1)-(A4) are satisfied, then: 
a. yji(t) > Yiret(t) exponentially for all i € ZN 
b. Yirer(t) > 6:Ag + QW(t) exponentially for all i € ZN, and 
c. Jim Gi — y;(t)) = àa (8; — ôj) for all i,j € ZN 
Proof: 
a. We first define (t) = x(t) — I,A: (t) — Mw,wi(t), and ¢i(t) = Èj emio y OAA -—A4@), 
Pilt) = X je mie aij Ct) (wi Ct) — w;(t)) for notational simplification. By noting the definition of p;;(t), 
we can write the time evolution of X; as: 


H(t +1) = Aix Çt) + Bi (K2 CE) + La AiE) + Ly,wi(t)) 
=m (iC) — y (0) — Mw, Swit) — Y 6:0) 
By replacing Ly, = Ta; — Killap Ly, = Iw; — Killw,, and £; = x; — €;, we can write 
i(t + 1) = (A; + B;K;)x;(t) — By Kje;(t) + B; (Ex; = K,M,,)ai(t) + (Tw; E Kil Jw) 
= (ANa; + Bila, JAi(t) + Tygi — (Ail, + Bilw,)wit) + Mwy: Ct). 
After some algebra we have 
Z(t + 1) = (A; + BK) ži (t) — Bi Kies) + ayi) + Ty, veiO 


From [1] and [35], under (A3), we know that @;(t), and @;(t) approach zero exponentially for all i € Z”. 
As for €;, we can see that: 


et +1) =x,(t+ 1) -—£,4¢ + 1) 
= Ai (x(t) — %,(t)) + H,C;(x;(t) = £.(t)) 
= (A; + H,C;)e;,(t) 
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since (A; + H,C;) is Schur for all i € Z", e;(t) approach zero exponentially. Lastly, since A; + B;K; is 
also Schur, from (17), %;(t) > 0 exponentially for all i € Z". From the above analysis, x;(t) > 
Ma; 4;(t) + Myw: Ct) exponentially. Whereas, from (10) and (12) we have Gill}, = 6; and Gilly, = Q. 
Thus, y;(t) > ô; A;(t) + Qw;(t) exponentially. And so, y;(t) > Yirer(t) exponentially for all i € Z” is 
shown. 

b. From [1] and [35], under (A3), à and w reach consensus exponentially. From [35], w converge to some 
trajectories W(t), which is a solution to the dynamics wọ (t + 1) = SWo(t). Whereas for à, from Lemma 
1, lim 4;(€) =A,.(to) and since A,(to) = Ag, therefore lim A; = Aq. Thus, yire¢(t) = 6,4; (t) + 
Qw;(t) > 6;Ag + QW(t) exponentially for all i € Z is shown. 

c. From proof a, we have that lim yi(t) = Yiref (t) holds. Whereas from proof b, we have that 
lim yi rep (t) = 5:Aa + Q(t). Therefore, lim y;(t) = 6;4q + Q(t). Evaluating lim (v(t) - y(t) = 
Aq(6; — 6;), the result is shown. (q.e.d). 

Remark 4. In Case LF, the leader node is also the leader agent that can change the formation scaling factor. 

The leader agent can change its A at any time and it is guaranteed that the scaling factor of the formation will 

converge to the A of the leader agent. 

By using algorithm proposed in this section, agents are not just achieving scalable formation, but the 
formation also moves along trajectory of W(t). By noting that eigenvalues of S lie on imaginary axis, the 
trajectory of w(t) could be in a form of step (constant) function, ramp function, and sinusoid function. When 
W(t) is a step function, it can then be used to shift the formation to a different location. Meanwhile, when 
W(t) is a ramp function, it can be used to make the formation moves continuously like a group of birds flying 
in a formation. Lastly, when W(t) is a sinusoid function, the formation can be made to move in a circle. This 
is useful when we want the MAS to perform surveillance while in formation. These three functions can be 
combined to create a composite trajectory that suit the purpose of the operation. 

We note that assumption (A4) is needed to guarantee the existence of feedback matrices Lj, and Ly,,. 
These matrices are needed to track the reference output, y;;e¢. Therefore, if assumption (A4) is violated, the 
scalable formation cannot be achieved. It is easy to see that to satisfy (A4), the rank of the matrices stated in 
assumption (A4) must be n; + p. From here, we also know that m; = p is necessary for assumption (A4) to 
be satisfied, where m; is the number of columns of B; (it is also the dimension of u; and p is the number of 
rows of C;. This means that the number of control input u; must be at least equal to the number of outputs it 
will regulate. For example, a 2-dimensional problem where y; € IR’, needs at least 2 inputs that deal with the 
dynamics in both dimensions. 


3.2. Main theorem and discussion for case LLU 
By having all necessary results established in section 2.4, we can state the main result of case LLU: 

Theorem 2: Given system of N agents (1)-(2) with control input u;(t) given by (3), observer dynamics given 
by (4)-(5), reference generator dynamics given by (6)-(7) where the communication graph G(t) is undirected 
with T as the bounded communication interval, and reference-output Yi ref (t) given by (8}). Let Ag be the 
desired scaling factor of the formation, the desired error bound between A» and Ag be v, the value t be 
computed using (16), and also let agent x be designated as the leader agent and setting A, at t* = t + tosia’? 
for all to, 2 0 following (13). Suppose (A1)-(A2) and (A4)-(A5} are satisfied, then: 

a. y(t) > Yirep(t) exponentially for all i € ZN, 

b. yirer(t) > iA + QW(t) exponentially for all i € ZN, and 

c. Jim Gi — yj(t)) = Aw (ô; — ô) for all i,j € ZY, 

where |A» — Aal < v. 

Proof: 

a. Case LLU uses the same set of control input, observer dynamics, reference generator dynamics and 
reference-output as Case LF. They only differ in the communication graph type. Therefore, the first point 
of Theorem 2 can be proven in the same way as in the first point of Theorem 1. 

b. From [1] and [35], under (A5), à and w reach consensus exponentially to some consensus value A and 
some consensus trajectories W(t). These show that y; ref (t) > ĉifo + QW (t) exponentially holds for all 
iez”. 

c. From proof a, lim y_i(t) = Yirer(t) holds. Whereas, from proof b, we have that jim Yiref (t) = fifo + 


QWCE). Therefore, lim y;(t) = 5A. + QW(t). Evaluating lim (vi(t) — yj(t)) = A.0(5; — 8), the 


premise holds. 
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Having computed t using (16), by choosing the reset value A, at t* = t + tosia following (13), from Lemma 
3 we have that |A% — Aal < v for the chosen value of v > 0. (q.e.d) 

Remark 6. For case LLU, to change the formation scaling factor, the leader agent cannot change its À to a 
new value at any time. Furthermore, the leader agent cannot determine the formation is scaling factor for the 
first consensus process. However, in case LLU, any agent can be designated as the leader agent during the 
operation as long that agent is given the authority to reset its à. In other words, the leader agent can be 
changed during the operation by just giving the reset authority to the new leader agent. 

As we will see in the numerical examples, due to the waiting time f, scalable formation under 
leaderless undirected network has longer time interval between different formation size compared to the case 
under leader-follower network. The most important step in computing t is in defining v (the error bound) and 
computing lla Cto., all, The (16) is applicable to any other systems utilizing consensus algorithm under 


time-varying undirected network. Thus, the result presented in this work can be used in general situation. 
One example is on consensus network; any agent that can compute t using (16) can estimate the consensus 
value of the network by just computing t and monitoring its own state. 


3.3. Numerical examples for cases LF and LLU 

Two numerical examples of the proposed method will be given in this subsection, one for each case. 
For both examples, N = 3, and agent 3 is chosen as the leader agent. The communication network consists of 
two graphs G, and G3 with A, = [a;;(1)] and Az = [a;;(2)] being the associated adjacency matrices. G (t) 
switches between G, and G, at every time step starting from G4, and the graphs are chosen such that G1 U G2 
satisfies (A3) and (A5) for each example. The system considered in both examples are the discrete-time 
version of the ones used in [3]: 


1 0.0998 0.0047 1 0.0978 0.0123 
A, = |0 0.9953 0.0905 A, = |0 0.9387 aa 
0 0.0905 0.8144 0 0.3667 0.4987 
1 0.0995 0.0296 ( 1 
A3 =|0 0.9852 osei), a = pie o|. c= lo = l 1) 
0 0.0490 0.9558 


Where B, = B} = B; = B, Cy = CG = C3 = C, and Q = ]. Additionally, 6, = 1,6, = 2 and 63 = 5. 
The first state in the system above is the position, while a ee and third states are the velocity and the 
actuator state. With the above choice of systems, (A1), (A2) and (A4) are satisfied by all agents. The 
formation in the examples is a one-dimensional formation since the output of agents has dimension of 1. 
Two-dimensional formation is possible when the output of agents has dimension of 2. Simulation result for 
each case will be given in the following. 


3.4. Numerical example for case LF 

For Case LF, a42(1) = a23(2) = 1, while other entries of A, and A, are zeros. Note that G, U G2 
is uniformly connected which satisfies (A3), and node 3 is the leader node. To show how the proposed 
method perform to achieve the scalable formation determined by agent 3, the initial condition for x, £, w and 
À are set to be arbitrary except for initial state of Az which are A3(0) = 1, A3(150) = 2 and A3(300) = 0.5. 
Figure 1 shows the time evolution of y; and we can see that agents track a ramp trajectory with the same 
gradient. Agents are also separated by a(d; = ô;), i,j € Z3, where a € R* is determined by agent 3. 


3.5. Numerical example for case LLU 

For Case LLU, a,2(1) = a21(1) = a23(2) = a32(2) = 1, while other entries of A, and A, are 
zeros. Note that G, U G2 is uniformly strongly connected which satisfies (A3). Meanwhile, the bounded 
communication interval T = 2. To show how the proposed method perform to achieve the scalable formation 
determined by agent 3, the initial condition for x,, and w are set to be arbitrary. The initial condition for 
A,(0), i € Z? is specified to be within a ball of B(0,0.1). Let Aq be 5 and 10 and let the actual formation 
scaling factor to be 0.9 Aqa < Aœ < 1.1 Ag. These imply that the first v = 0.5 and the second v = 1. To 
compute the first and the second t, we use |A(ty)|, = 0.3 and |A(t)|, = 16 respectively. 

From the data above, the first t = 1280 and the second t = 6060. Therefore, we set A3(t,) for 
to = 1280 and ty = 7340 following (13). Figure 2 shows the time evolution of y; and we can see that agents 
track a ramp trajectory with the same gradient. Agents are also separated by a(6; — ô;), i,j € Z?, where 
a € R* is determined by agent 3. 


Formation control of non-identical multi-agent systems (Djati Wibowo Djamari) 


2730 O ISSN: 2088-8708 


. 
san 
sssnnntt"” 
aoe 
. 


> 
. 
. 
. 
. 
. 
. 
. 
. 
. 
. 
. 
* 


sans 
-- . PELLS + 
Los N tena e nen eneeee 


. 2 \ aie 
peneueneeenee a ‘N - 


60 | | 
cuceeneeeeeuemenneeeeanenenneneny 
50 Yy —— WY, "nuun y, : i 
: 
z 
40 FH l | 
= | CEEE EJ seeeeeee ane one anus supeeeee oo ot 
: I 
20 H s | | 
i l 
z 
10 H r 5 | 
Aree | 
L f i i i 
0 2000 4000 6000 8000 10000 12000 


Figure 2. Plots of y; (t) againts t for case LLU 


4. CONCLUSION 


Scalable formation for non-identical linear system agents under a time-varying network has been 
presented in this paper for two graph cases. The first graph case is LF or leader- follower network. This is the 
common network setting used in any other scalable formation problem. Leader-follower network makes the 
formation scaling adjustment easy during the operation since the formation size will always follow the initial 
state of the leader’s virtual state (A). The leader agent can change the formation size at any time. However, 
changing leader agent during operation is not possible due to the leader- follower network and spanning tree 
restriction. The second graph case is LLU or leaderless undirected network. This is the first work on scalable 
formation that uses undirected network, and it allows for a leader agent to determine the formation size. 
Although the leader agent cannot change the formation size any time (it needs to wait for t), the leader agent 
can be changed during the operation, and it is useful when the leader agent is damaged or having technical 
issues. The presented method can also be used to estimate the consensus value of an undirected network for 
more general problem. 

The scalable formation is achieved, via internal model principle, by having agents track a 
reference-output with bias. The bias has two components which are 6; and A;. The former is a static 
component where the desired output of agents is specified, whereas the latter is a dynamic component 
updated in a distributed manner. To make the reference- tracking possible, feedback matrices need to be 
computed and the system’s matrices must satisfy certain requirements (assumptions (Al) and (A4)). The 
contribution from this work, scalable formation formulation for non- identical agents, makes formation with 
size adjustment possible to be developed using different type of agents (with different mathematical model). 
Several types of agents can work cooperatively to be used in surveillance, searching mission, and defense 
formation. Thus, the proposed method opens more application for MASs. We take note that the 
output-feedback internal model principle used in this work is not robust to model mismatch. A robust 
scalable formation for non-identical agents is a possible future research direction. 
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